Reconngurable Computing: Architectures, Models and Algorithms Current Science: Special Section on Computational Science
نویسندگان
چکیده
ion Mapping Parameters Scheduling Figure 2: Traditional Design Synthesis Approach and the Model-based Approach One major problem in using FPGAs to speed-up a computation is the design process. The \standard CAD approach" used for digital design is typically employed (see Figure 2). The required functionality is speci ed at a high level of abstraction via an HDL or a schematic. FPGA libraries speci c to a given device (e.g. Xilinx, Altera, etc.) and time consuming placement and routing steps are required to perform the logic mapping. This approach of logic synthesis as opposed to algorithm synthesis allows the user to specify the design using a behavioral model. But this abstraction is achieved at the expense of performance. The semantics and nature of the algorithm are lost in the mapping phases. The model based mapping environment takes into account the capabilities and limitations of current as well as projected hardware technologies. In Section 4 we illustrate some of the models and describe algorithmic approaches in Section 5. Parameterized models for algorithm design and analysis will possess the following characteristics: Cost models for analysis of recon gurable architectures. Techniques for partitioning and placement of designs exploiting algorithm and input structure. Cost analysis incorporating the cost of recon guration and partial and dynamic recon gurability. Impact of o -chip communication in designing recon gurable computing solutions. Tradeo s between recon gurability and redundancy of hardware. 5 Logic Configure Execute Execute Design Configurations Design Configurations Figure 3: Static con gurable computing Utilizing FPGAs for speeding-up applications has been mostly limited to developing con gurations which optimize the computation time for a given task. The optimized con guration is then used to execute the task. This process is illustrated in Figure 3. A given computational task is analyzed and an optimized con guration is developed for that computational task. The con gurable logic device is then con gured, usually under the control of the host, with this optimized con guration. Finally the con guration is executed by initiating the computation and communicating the data to the device. The programmability of the device is not exploited and the logic resources are not reused during a computation. Execute Design Configurations Design Logic Configure Configurations Execute Figure 4: Dynamic con gurable computing The conventional approach is static, because the hardware is con gured just once, followed by execution. The concept of dynamic con gurable computing is illustrated in Figure 4. The con gurable resources are reused by recon guring the hardware after a computation is completed. The con guration of the logic and the interconnection network are adapted on the y during the execution. The run-time recon guration can be based on intermediate results generated by the computations. This approach has enormous opportunities to achieve higher performance than conventional approach by closely adapting the hardware to the nature of the computation. 4 Recon gurable Computing Models Bridging the semantic gap between the algorithm and the hardware by using such a model allows the user to develop recon gurable computing solutions in a natural manner. We discuss here brie y a theoretical and a practical model which have been developed to map computations onto recon gurable architectures. 4.1 Recon gurable Mesh Model Recon gurable Mesh is a theoretical model of a VLSI array of processors overlaid with a recon gurable bus architecture [44, 45]. An overview of various models and architectures designed based on the model is given in [8]. A recon gurable bus architecture consists of a multi-dimensional array of processing elements (PEs) connected to a bus through a xed number of I/O ports. This bus architecture is capable, on a per instruction basis, of con guring a topology that contributes to solving the problem at hand. Bus recon guration is achieved by locally con guring the switches within each PE. Di erent shapes of buses such as rows, columns, diagonals, zig-zag, and staircase can be formed by con guring the switches/ports. 6 A two dimensional processor array with a recon gurable-bus system of size MN consisting of identical processors connected as a M N rectangular mesh system is called a recon gurable mesh. An example of a 4 4 recon gurable mesh is shown in Figure 5. A set of four I/O ports labeled N, E, W and S, connect each PE to its four neighbors to the north, east, west and south, respectively. Each PE has locally controllable switches which con gure the connection patterns between the four I/O ports. The switches allow the broadcast bus to be divided into sub-buses, providing smaller recon gurable meshes. The bus and all I/O ports are assumed to be m-bit wide. The connection patterns are represented as fg1,g2,...g, where each of gi represents a group of switches connected together. For example fNS,E,Wg represents the connection pattern with N and S connected and E and W unconnected. N Ε W S 0 1 2 3 0 1 2 3 PE(2,3) switch : Figure 5: Recon gurable Mesh. The basic computational unit of the recon gurable mesh is the Processing element (PE) which consists of a switch, local storage and an ALU (Fig. 5). In a unit time, a PE can perform: 1. Setting up of a connection pattern. 2. Read from or write onto a bus or local storage. 3. Logical or arithmetic operations on local data. Various models of recon gurable meshes have been proposed in the literature. Most of these models are synchronous in nature and permit unconditional global switch setting in addition to local switch control. Unconditional global switch setting is performed by the broadcast of a global instruction from a central controller. Recon gurable mesh models can be characterized by several parameters such as data width of the PE, signal propagation delay, shared/exclusive access to the bus, switch connection patterns, among others. 4.2 Hybrid System Architecture Model (HySAM) Hybrid System Architecture Model (HySAM) is a parameterized model of a con gurable computing system, which consists of con gurable logic attached to a traditional microprocessor. The HySAM model cleanly partitions the capabilities of the hardware from the implementations and presents a very clean interface to the user. Figure 6 shows the architecture of the HySAM model and an example of an architecture. The architecture consists of a traditional microprocessor, standard memory, con gurable logic, con guration memory and data bu ers communicating through an interconnection network. More details of the model can be found in [6, 9]. 7
منابع مشابه
Special Section on Two Themes: CSE Software and Big Data in CSE
The 2015 SIAM Conference on Computational Science and Engineering (CSE) was held March 14-18, 2015, in Salt Lake City, Utah. The SIAM Journal on Scientific Computing (SISC) created this special section in association with the CSE15 conference. The special section focuses on two topics that are of significant current interest to CSE researchers: CSE software and big data in CSE. The development ...
متن کاملLITERATURE REVIEW: Parallel Computational Geometry on Multicore Processors A Review of Basic Algorithms
Parallel Computing has been one of the most important problems in computer science as well as a solution to satisfy the growing need of computation speed. Although in the past decades, parallel computing has been an interested area, it is going to be more popular and useful among simple users. In the past years, the area was strange for home users due to the high cost of installing a parallel c...
متن کاملFrom molecule to man: mesoscopic models of living cells, ongoing projects within the Section Computational Science (UvA)
In these pages a brief overview is given of the expertise within the section computational science on modelling and simulation methods, High Performance computing and scientific visualization and virtual reality, applied in computational medicine. We will provide here a short overview of current collaborations and ongoing projects. A major challenge in developing models of systems from biology ...
متن کاملA feedback guided interface for elastic computing
Computer Science plays an important role in today’s Genetics. New sequencing methods produce an enormous amount of data, pushing genetic laboratories to storage and computational limits. New approaches are needed to eliminate these shortcomings and provide possibilities to reproduce current solutions and algorithms in the area of Bioinformatics. In this paper a system is proposed which simplifi...
متن کاملIntroduction to Model Order Reduction
In this first section we present a high level discussion on computational science, and the need for compact models of phenomena observed in nature and industry. We argue that much more complex problems can be addressed by making use of current computing technology and advanced algorithms, but that there is a need for model order reduction in order to cope with even more complex problems. We als...
متن کاملGeography and Computational Science
n many U.S. campuses, a rising tide of scholarly activity, called computational science , is being recognized by academic administrators through the establishment of research centers and degree programs (Rice 1994). This nascent discipline has emerged from several traditional fields, including for example, cosmology, geography, and pharmacology, that contain subareas in which computing is the p...
متن کامل